Improving Performances of BoW-based Image Retrieval by Using Contextual Keypoint Descriptors
نویسندگان
چکیده
The paper reports an improved method of content-based image retrieval using a well-known method of bag-of words (BoW). Words built over descriptors of popular affine-invariant keypoint detectors (Harris-Affine and Hessian-Affine are exemplary choices) are used. What is novel, however, is the number of descriptors (i.e. the number of words) representing individual keypoints. Instead of SIFT (or another alternative descriptors SIFT-like descriptors representing both visual properties of keypoints and their local configurations are proposed (adopted from our previous works). In average, each keypoint has 10-15 such descriptors, but the increased size of BoW representation is in our opinion acceptable because of significant performance improvements in BoW-based image retrieval, as shown in a feasibility study on a popular benchmark dataset. Such an improvement is possible because very large vocabularies can be built over the proposed descriptors without compromising the sensitivity of words to minor geometric and photometric distortions.
منابع مشابه
Seeing the Big Picture: Deep Embedding with Contextual Evidences
In the Bag-of-Words (BoW) model based image retrieval task, the precision of visual matching plays a critical role in improving retrieval performance. Conventionally, local cues of a keypoint are employed. However, such strategy does not consider the contextual evidences of a keypoint, a problem which would lead to the prevalence of false matches. To address this problem, this paper defines “tr...
متن کاملExtended Keypoint Description and the Corresponding Improvements in Image Retrieval
The paper evaluates an alternative approach to BoW-based image retrieval in large databases. The major improvements are in the re-ranking step (verification of candidates returned by BoW). We propose a novel keypoint description which allows the verification based only on individual keypoint matching (no spatial consistency over groups of matched keypoints is tested). Standard Harris-Affine and...
متن کاملBIG-OH: BInarization of gradient orientation histograms
a r t i c l e i n f o Extracting local keypoints and keypoint descriptions from images is a primary step for many computer vision and image retrieval applications. In the literature, many researchers have proposed methods for representing local texture around keypoints with varying levels of robustness to photometric and geometric transformations. Gradient-based descriptors such as the Scale In...
متن کاملUniversity of Marburg at TRECVID 2010: Semantic Indexing
In this paper, we summarize our results for the semantic indexing task at TRECVID 2010. Last year, we showed that the use of object detection results as an additional input for SVM-based concept classifiers improved the overall performance. This year, we investigated whether a state-of-the-art bag-of-visual-words (BoW) approach can also be improved by adding object-based features. In this conte...
متن کاملContent-Based Image Retrieval Based on Late Fusion of Binary and Local Descriptors
One of the challenges in Content-Based Image Retrieval (CBIR) is to reduce the semantic gaps between low-level features and high-level semantic concepts. In CBIR, the images are represented in the feature space and the performance of CBIR depends on the type of selected feature representation. Late fusion also known as visual words integration is applied to enhance the performance of image retr...
متن کامل